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SUMMARY 

The  state  of  ait  concerning  ■models  for  recall  aiid;  recdgriitipn  is  reviewed;  A 
distinction  is  made-  between  network  mckldsV.  se^d'ch  or  separate 
and  composite/distributed'  memory  iiiPdeisi  T^e  mpdels  are  compared'  on  a 
number  of  aspects,  such  as:  whether  they  are  based  on  sepmate  or  composite 
traces,  de  nature  of  the  memory  representation,  context  dependency,  etc. 
Following  this  review,  a  comparison  is  made  with  respect  to  the  ability  to  predict 
basic  findings  in  memory  research.  The  report  concludes  with  a  discussion  of  the 
advantages  and  disadvantages  of  quantitative  models  of  ineniory. 
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SAMENVATTING 

Er  wordt  een  ovemcht.gepresenteerd  van  de  st^d  ^  z^eh  met  betiekidng  tot 
mpdellen  vpor  herinneriilg  en  herkeiining.  Er  wordt  eeh  oiiderscheid  ;gemaakt 
tussen  lietwerkmodellen,  zoelqhodelien,  en  modellen  met  een  gedistribueerde 
geheugenrepresentatie;  De-  verschillehde  modellen  Worden  op  een  a^t^ 
aspecten  veigeleken,  zoals  de  vraag  Of  ze  gebaseerd  zijn  op  gescheiden  dan  Wei 
samengevoegde.  geheugensporen,  de  aard  vw  de  geheugenrepresentatie,  context- 
afhankelijicheid,  e.d.  De  modellen  worden  ver^olgens  veigeleken  in  termed  van 
de  mate  waarin  zij  in  staat  zijn  empiris^e  tosultaten  te  ^rldaren.  Het  rapport 
wordt  afgesloten:  met,  een  bespreking  van  de  vbor-  (sn  nadelen  van  :het  werken 
met  kwantitatief  geformuleerde  modellehvoor  hetgeheugen. 


The  stu<^  of  area,  that  h^  been  driven  by 

modelsr.sinces  at  sleasf  the;  .Over 

niodellihg.  h^  ishiftedi  :fiomi  veiy  ^pre  isin^^;  ifexpenmehts  :tp;  what 

mipt^be:  hest‘dei:aibe  ^isemirquantitati^  -fittingipf  afvwde;  variety tof  phehpm- 
eha  ’fipm  a  humber  of  experimehM  -psura^g^.  XcbmpafC;  fpf  :mstitoce ’Bpweifs 
bne-eieirient  model-  [Bp^i;-  i96ll  tp  -any  of  ^^er  current'  models  such  as  ACT' 
[AhdefsPn,,  l983bb  SAM*  .[Raaiji^  Shiffrin,  198i;HGUl^  &;  ShiMihi, 
19»4]-or  ir©rwdVi^[Mur^ 

The  complexities  of  recent  models-  cPntnbutes  to  the  apparent  ;impossibility'  6f 
dedding  between  them.  ^MthoUgh  couched  in  quite- different  terms,,  they  spfteri. 
make  very  similsu:  predictions,  at  least  under  appropriate  choices  of  parameters. 
This<makesrit!difficuU  tO»generate  Critical:  empiricairteris..  Gh?the  other  handj  ;the 
similarify  of  predictiphs.  suggests,  real  . progress  in  theory  development,  .forced  by 
the  necessity  tp  accduntfoir  a  staiidard  aiid  a^ed^upon  coipusf.of  findings. 

In  this  chapter,  we  will,  review  a  number  Of  the  most  important  Contemporary 
ihPdelsrof  memory,  trying  to  ^highlight  the  simil^ties  and  differences  in  ^the  way 
they  handle  basic  facts;  about  recall  and  .recoghition.  Space  limitations  prevent  us 
from,  airy  attempt  at  exhaustive  coverage.  For  the  same  .re^pn,  although  a, 
number  of  models  can  sof  do  predict;  response  latendes,.  we  leave  coverage  of  this 
important  topic  tor  a:  future  chapter. 


2  THEORETICAL  APPROACHES 

Although  aity'  type  of  classification  is  bound;  to  be  unsatisfactory,  we.  will  use  a 
classification  of  current  mpdels  in  three  basic  categories:  i)  separate  trace 
models  involving  spreading  activation,  or  making  no  explicit  activation  assump¬ 
tions;  we  terin  these  network  models;  2)  separate  trace  models  involving  parallel 
activation,  termed  search  or  episodic  trace  models;  and\3)  corhposite/distributed 
memory  models. 


2.1  Neri^rk  models 

Network  models  propose  that  long-term  memory  consists  of  a  set  of  nPdes  and 
links  cpnnecting  the  nodes.  The  nodes  represent  concepts  or  cognitive  units 
(Anderson,  1983a,b)  and  the  links  semantic  or  episodic  relations.  Whenever  two 
items  are  studied  together,  a  link  between  the  nodelirepresenting  these  items 
may  be  formed.  In  most  of  these  models,  a  process  of  spreading  acti^^tion 
determines  the  retrieval'  of  infprmatipn  from:  memory.  Basically,  there  are  two 
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types  of  network  mddM:  (a)^  thfe  dU-or-rione  ^  the? 

cgnti^us  gcti^^ 

^e  f^iprTnoho  actiratipnf^^^m  assumes  that  network  nodes  ^are;  either  a^ye 
or  inactive:  The<bestMq^'  example.is  Ac’ AiPP-®p4ei5p  ^dersoty 

(1976);.  In'  suck  a  inodeli  Jthe;  isprcadmg  :pi . activation  ^is  deterged:  by  ;the 
(relative)?  sttengpth  of  ike:  hpdes:  pr  ffie;  U^irSuppoM  that^  it 

ate  cPnnected  <by  a  lir^  s^ye,  tte  ;prpbabiiityftOf;;activating;;l^^ 

the  next  unit  of  time  is  a  fimctiPii  6t,s/Sy  'the;  relative  stren^  ;pf  the'  link;;/ 
compared  to  all  other  links  emanating  from  or,;  aiternatiwlj^,  Jthe  relative 
strength  pf  node  7  compared  to  all  other  nodekUnked  tp.X  In  suCh  a  mpdel,  the 
probabiUty  ofrretrieyingpy  giVen;thatrXis:active,-is  equid  tp  vthe  likciihppd  that  7 
is  activated  before  a;specifred.cutoff  time. 

A  continuous  activation  model  waa  developed:  by  Anderson  (1983a,b)  as  an 
alternative  to  the  alhor-npne  model  The  basic  difference  is  that  network  ,  nodes 
now  have  a  cpritihuPusly  vaiying;:activatidntstrengthi.  This  means  that  phe;  needs  a 
different  rule  for  determining  whether  a  memory  trace  has  been  successfully 
retrieved;  If  a  stimulus  inode  JT  h^  an  associative  link  to  another  node  7,  some: 
activation  will;spread::from  Jf  to  7i  The  amount  pf:actmti6n  of  7  is  determined, 
by  the  relative  strength  .ofthe  link  between  and  7  (coiiip^ed  to  <ill  other  links' 
from^.  In>such:a  model  it  becomes  more  natiiralito  assume* that  theiprpbability 
and  latency' of  retrieving  the  trace  7;are  a  function  of  the  amount  of  activation, 
of  7.  Thus,  the  notion  of  spreading  activation  has  changed  from,  gradually 
activating  connected  nodes  (i.e.,  distant  nodes  take  longer  to  activate)  to  a 
dynamic  model  in  which  the  activation  spreads  rapidly  over  the  network  but  in 
varying  degrees  (i.e.,  distant  nodes  have  a  lower  level  of  activation). 

As  an  example,  in  the  most  recent  version  of  Anderson’s  ACT  theory,  the  ACT* 
model  (Anderson,  1983b),  it  is  assumed  that  during  storage  mempry  traces 
(called  cognitive  units)  are  formed.  Traces  vary  in  strength  (a  functipn  of  the 
number  of  presentations  and  the  retention/interval),  and  these  strengths  deter- 
ihine  the  amount  of  activation  that  conve'iges  bh  the  trace  from  associated  nodes 
(thus,  in  this  model,  it  is  relative  node  strength,  not  link  strength  that  determines 
fte  flow  of  activation;  it  is  not  evident  whether  this  makes  a  difference).  Thus,  in 
a  paired-associate  recall  situation,  where  the  subject  learns  a  list  of  pairs  A-B,  it 
is  assumed  that  the  trace  (the  cognitive  unit)  encodes  the  informatipn  that  this 
pair  was  presented  in  this  context.  At  test,  the  re.sponse  will  be  retrieved  if  (a) 
such  a  trace  has  indeed  been  formed,  and  (b)  it  can  be  retrieved  within  the 
cutoff  time. 


22  Episodic  trace  models' 

The  basic  characterif.tic  of  episodic  trace  models  is  that  they  assume  a  set  of 
separately  stored  memory  traces  that  are  activated  in  parallel.  Such  models  are 


winfejtow  (^ed  Vearctiiinodefe  l^ 

traow :  onef^^suik^ 

tidn:irpm4bngite^  memory  if^lyes  5Sfc^enti^^^p^  |et  of  tmemoi^^ 

traoesi  4^eif»est:;imb^:  ex^plevof 

Memory  (S^j^^mddbl  pro  R'^jm^ers  1981);  lii 

Si^  the  sapling  prohabili^  of  a^  particular  trace  depend  on  the  ielative 
strehgdi4of  that;tfa^’'bomp^d'td?aU;p&^^ 

The  SAM'  model:  assunies!  'that  dufihg.  stpr^e»  ihfoimatibh  ^is  <rejpresented;  in 
’’memory  iimages?,  which  contam  iteh^  assbciatiV«!  ^^d  conteidual  infprmatiom 
TTie'  ^oimt:  ahd:;tJ^^bf  Jinfpfmation-stpred;  is,  determihedby  cb^g -processes, Jn 
STS.  in  mbst:(intentiohd)  ;leaiiim  the^  ^biint  bf  lnfbi^ 

is  a.function'  of  the  length ^of  time  .that  theritemiis  studied' while  ih  ST^.  Accprd- 
ing'  to  the  SAM  mbdel,  ietriev;^  from  US  is.  frascd  on  cues  .(context,  itemSj 
category  names).  \^ether  an  image  is  retrieved;  or  iibt,.  depends  on  the 
associative  strengths  of  the  retrieval  cups  to  that  image.  These  strengths  me  a 
functions^of  the  overlajp  of  the  cue^  iMbrmation  and:the  infbrmatibhistdfed  in  the 
image. 

An  important  property  of  the  SAM  mbdel  is  that  it  incorporates  la  rule  to 
describe  the  overalLstrength  of  a  set  of  probe  cues  to  a  particular  image:  the 
overall  activation  strength  (A(i))  is;  eqiid  ta  product  of  the  individual  cue 
strengths  (weighted  if  necessary  for  relative  salience  or  impprtance).  This 
multiplicative  feature  focuSes  the  search  process  on  those  images  that  are 
strongly  associated  to  a//  cues. 

In  recall  tasks,  the  search  process  of  the  SAM  model  is  based  oh  a  series  of 
elementary  retrieval  attempts.  Each  attempt  involves  selecting  or  sampling  one 
image,  based  on  the  relative  activatibn  strengths.  Sampling  an  image  allows 
recovery  of  information,  from  it.  For  simple  recall  tasks,  the  probability  of 
successfully  recovering!  the  name  of  the  encoded  word  is  assumed  to  be  a  simple 
function  of  the  weighted  strengths. 

Although  the  S/iM  model  assumes  that  the  process  of  activating  information  is 
basically  the  same  in  recall  and  recognition,  there  are  some  important  differ¬ 
ences  between  these  two  processes.  It  iS  assumed  that  recognition  does  not 
necessarily  involve  sequential  sarnpling  but  is  (mostly)  based  bn  a  daect  access 
process  involving  a  single  retrieval  step  (Gillund  and  Shiffrin,  1984,  p.  55-56). 
ITie  recognition  decision  in  this  direct  access  process  is  based  on  the  sum 
(s%A(fc))  of  the  activation:  strengths;  if  the  same  cues  are  used  to  probe  memory 
for  reci^:  and  recognition,  the  activations  are  the  same  in  both  cases,  though 
used  in  different  ways.  As  we  shsdl  see,  the  process  of  summing  activations 
makes  the  SAM  model  for  recognition  remarkably  similar  in  structure  to  models 
that  appe^  quite  different  on  the  surface,  even  models  that  sum  inputs  at 
storage  rather  than  retrieval  (like  most  composite,  distributed  mbdels). 
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M  reprise  is  :i^e= 

the  ((fetnbutipfr  ^  determines  -perfPi^^  reason^  SAM 

mcprpprateS  specific  van^ce;  gumptions;  in::piiificulai5;'the^:^s^^  deyiatipn 
pf^the  distribution  pi  a  ^veniStren^:is^^i^ed>tQ"j^ipiPpPrt^ 
stren^  vtduP  (GiUimd;&:Shi£friniii9^;;$b^^  i$^). 

The  SAM  mpdel  assumes  that;  for  typical^epi^  cPnte^al^ 

inforinatioh  is  ^ways  encoded  in^the  inempiy  image,  and  conti^^  one  pf  the 
retrieVid  cues,  Mensink  and  RaaijmjJcere  (i^iSiSj 

the  model  :to  handle  time-deperideht  chahgesfin^  t^^  The  basic;  idea^ 
adapted  from  Stimulus  Sampling  ttePiry  :(&tes,  1955)^  is-that  :a  rmdom  fluctur 
atidn  Pf  eleihenisVocCurs  between  two  wts;  a  set' pf .  awulable  Context  elements, 
and  a  set  pf  (temporarily),  unavailable  cohteirt  elements:  Perfonhmce  iis,  a 
function  of  the  relationship  between  sets  of  available  eleihents  at  different  points 
in  time  (viz.  study  and  test' trials). 

Hihtzman  (1986,  1988)  developed  a  model  for  episodic  memory  that  bears 
similarities  to  Gillund  and  Shiffrin%  SAM  model  for  recognition.  This  model, 
MINERVA  2,  has  been  applied  primarily  to  category  learning  and  recognition 
memory.  It  is  assumed  that  each  experience  produces  a  separate  memory  trace. 
Both  items  arid  memory  traces  are  represerited  as  lists  of  features  or  vectors.  Iri 
simulations  of  the  model,  it  has  been  assumed  that  each  feature  is  lridependently 
encoded  with  prPbability  L,  a  leamirig  rate  parameter.  When  a  probe  cue  is 
presented,  all  memory  traces  are  activated  in  parallel.  The  amount  of  activation 
of  aity  particular  trace  is  a  nonlinear  fimction  of  the  similarity  to  the  probe  cue. 

As  in  the  SAM  model,  recPgnition  perfonriance  depends  on  a  single  value,  the 
summed  activation  of  all  traces,  In  order  to  allow  recall  to  be  carried  out,  the 
model  also  stipulates  that  a  vectpr  is  retrieved.  This  vector  (called  iht  echo)  is 
the  sum  of:  all  trace  vectors,  each  weighted  by  its  activation  value.  Because  of  the 
weighting,  and  the  nonlinear  activation  rule,  the  echo  will  contain  a  dispropor¬ 
tionate  representatiPri  of  those  traces  similar  to  the  memory  probe.  Thusfif  part 
of  trace  j  is  used  a  a  probe,  the  echo  will  contain  a  strong  representation  from 
the  entire,  trace  j.  For  example,  if  a  trace  encodes  a  studied  pair  A-B,  md  A  is 
used  as  a  probe,  the  echo  will  contain  something  similar  to  v4-B,  allowing  J5  to  be 
recalled.  Of  course,  the  retrieved  trace  is  actually  a  composite  of  maiy  traces 
(unlilre  the  SAM  mpdel),  so  spme'mechanism  is  needed  to  extract  some  particur 
lar  item  from  the  composite  --  Hintzman  (1986, 1988)  discusses  . several  possibil¬ 
ities,  such  as  comparing  the  echo  to  the  stored  traces,  or  repeating  the  retrieval 
process  several  times,  each  time  using  the  retrieved  echo  as  a  probe^  until  the 
echo  achieves  a  stable  value  (usually  matching  some  stored  trace).  In  aiy  events 
one  basic  difference  between  SAM  and  MINERVA  2  is  that  the  latter  model 
assumes  that  in  recdl  a  kind  Pf  composite  meniory  trace  is  retrieved  (at  least 
initially),  whereas  the  SAM  model  for  recall  bolds  that  a  specific  memory  trace 
is  sampled  (initially,  though  different  traces  may  be  sampled  subsequently). 


K  'rc<»ntv;yea^.  cpm]psite/4istnbuted"ffiei^^ 

gip^jg  I)6puianty.^(^^  refer  reader  to:.  ;a;  recent* 

ctiapterjihviwitta/jJto^  by 

blit  .somfewhat^f^elent cl^^s,vln^6nc^d^  iteimi^= 

in  MINER\^  i)j  of  matiices  ofielement^feat^  cpnsists^pf 

a  suOT  of  the  ye^ofs  or  matrices  (e;g.  TOIMM:  [Murdock^  1982], 

[Mettle  Eic^  1982;  1985];.  J^esi  Ahdef^d^K'^^of  mpdel  jjAndersoj^  ,19^^^^ 
the^^Ma^ "rrtodel  [Pikfii  1984;  ifllumphr^;  B^;&  Pilfe,  i98^];;I(^eryaV  SDM 
mPdel  :[Kanervt4  1988]  :Mswp£u^{^  :bet^en /tins  class^  a^^  stpragei 

cli^s  of  the  previous  ;^ctipn):  In^^  ^^  Mtond;  cl^jhmeinory*^  consists  of  modes 
cohneded  by*  weij^t^  links;  items  sit.  represented  1)y  a  i  pattern  of'  sd  of 
activations  pf  :the  hPdes,  and  longrterm^memPry  cpmis^  of  the.  yalues  of  ithe 
weights;  on  the  links:  (evg.,  Grossberg’s  ART  model-  [Grossberg,  1987;:  Grpssberg 
<SL  Stone,  1986];  JanieS  Amderson’s*  BSB^  m^  [JA  Amderspn  et  alij,  .1977],  or 
any  of  the  feedforward^  backpropagatiph  models;  McClelland;  and:  Ruinelhart’s 
recurfentimodel). 

The  basicidifferehceTbetiwen  such jmbdeis?and'the:  models  disCTissed'preyiously  is 
that  compotite/distributedimemPiy  ni6dds:^sume  'that;a  memory  trace  is  not:  a 
distinct,  locdized  entity  but  rather  p^\of  a  combination  or  superimppsition  of 
all  traces  input  to  the  system,  it  is  this  aspect  that  has -made  maiy  of  these 
models  seem  both  n^steribus  to  the  novice  (who  wpriders  how  memory  can  be 
as  good  as  it  is),  and 'attractive  to  many  experts  (who  can  explain' wlty  memory  is 
as  bad  as  it  is,  and  how  we  can  extract  averages  and»  prototypes,  from  inputs,,  and 
who  like  the  analpgy  to  neuronal  structures). 

These  cbinposite/distributed  storage  assumptions  can  serve  as  a  basis  for  a 
memory  model  because  for  each  version  there*  exists  an  appropriate  retrieval 
operation.  In  some  cases  the' cue  will  retrieve  a  noisy  version  mf  the  original 
trace  containing  that  cue;  in  other  cases  the  cue  will  retrieve  a  noisy  version  of 
an  item  originally  stored  as  an  associate  of  the  cue;  in  yet  other  cases  the 
retrieval  may  be  a  clearly  definable  response,  but  with  a  type  of  noise  determin¬ 
ing  the  pipbabilify  of  reaching  such:  a  state,  and  determining  whether  the  state 
would  be  the  correct  one.  The  retrieved  information  can  be  matched  against  the 
input  to  perform  recognition,  or  if  neceSsary  can  be  -cleaned  up’  imsome  fashion 
to  allow  a  response,  to  be  . emitted. 

As  an  ex^ple,  consider  one  version  of  the  Matrix  ;model  proposed  by  Anderson 
et  ali  (1977;  termed;  BSB  for  ’brain  state  in.a  box’).  Whenever  two  items  (fj,  :gi) 
are  associated,  a  matrix  Aj  is  prbduced  with  cell  elements  Ai(r,s)=fi(r)gi(s).  The 
composite  memory  (M)  consists  of  the  sum  of  all  such  association  matrices, 
M  =  s  Aj.  Ignoring  for  simplicity  the  details  of  the  node  activation  process  (such 
as  its  nonlinear  limitations  on  activation  growth),  the  retrieve  of  an  ^sbciate 
(&)  given  a  cue  item  (fj)  can  be  obtained  by  postmultiplyihg  M  with.fi:  the  .result. 


12. 


Ml^  isia  noi^  com^ite  of  tiipse  V  that7jiad«b€epstud(^,ft^  ands 

iteins;  siMaf  to  fj  (see^^M  W%  pi  Ml)*  tlus  modelhis 

formulated  re^lji  a  :Simpje  imo^c^^  3^: 

hancUe;  re0;^tipn  ,  mem^^  >basic;  ;idea  is;  ffiat  ?re.w^tidnv  |^^ 
matcWngvPt^rationr.oiifte  :cpm|k)Mte  imemoiy  ittace  ^th  stKer^ 
item.  ?In  o: J|r  for  itWi,  tpi  vvfpri^.  it  miKt;  be  jassiumediitfiat  ;ffle  vinempiy  strace^ 
inciudestnotrpnly;  associative  iinfonnatidnibut;^  . 

Although  we;  haw^^  the  shprt^tenh;activati6n  fe  of  sthc  BSBjsmpdpb 

it  is  atmember  ofjpbr  second  claM.  We  begin  pur  discuMipm 
a-mpdei  clbsely';related  to  Andejrsonh'butMthout  a  node  abtivatibn^pi^ 
its;  nonlinear  limit  oh,  :acti^^tioh  values),  M>.is  4  the  Matrix  model  p^ 

Pike  (1984;  sec  also  Humphreys,  Bain,,  and  ^  1989,  and  Humpluttj^i  pike, 
Bain,  &  Tehapj  1989),  Associations:  of  item  veetprsVare  repie^nted  by  nt  Ices 
tp  ifonn  a  comj^site  ;memoty  matrix.  In;  recent  versions  pf  this 'inodel,  context-to- 
-item  associations  have  been  incorporated  in  the  matrix;  :mbdel  in  order  to 
account  for  the  fact  that  mempry  of  ;a  tb-be-memiorized  list;  is  to  some  extent 
"isolated”  frbm  all  other  memories.  Thus,  instead  of  storing  a  two-way  association 
between  the  members  pf  a  paired  associate,  a  three-way  association  between  the 
two  items  and  the  'Cohtext,..is«stored  (in  toe  fonn  of  a  3-tomeristonal  matrix).  In 
order  to  retrieve  gj  the  memory  matrix  (M):  is  multiplied,  in  a  spedfic  way  (see 
Humphreys  et  al;,  1989)  with  the  matrix^obtained  by  multipUcatibn  of  the  context 
(x)  and  Item  (fj)  vectors;  The  latter  product  defines  the  "interactive"  retrieval,  cue 
representing  the  assodation  of  context  iand  stimulus  item;  This  incorporation  of 
contextual’  a^odatipns  makes  it  possible  tp  distinguish  between  episodic 
(list-spedfic)'and  semantic,(preexisting)  assodations. 

Related  models  have  been  propbsed  by  Murdock  (1982)  and  Metcalfe  Eich 
(1982;  1985;  see  also  Metcalfe  ;&  Murdock,  1981).  In  both  Murdockb  Theory  of 
Distributed  Assodative  Memory  (TODAM),  and  Metcalfeh  Composite  Holo¬ 
graphic  Astodative,  Recdl  Model  (GHARM)^,  the  assodative  encoding  .and 
retrieval  operations  are  the  mathematical;  pperations  bf  convolution  and  correla¬ 
tion,  respectively*  (see  Metcalfe  Eich,  1982,  1985). 

The  TODAM  mpdel  assumes  that  when  each  assodation  A-B  is  studied,  the 
veetprs  repretonting\.4:  and  B,  and  the  convolution  vector  representing  A-B,  are 
all  added  to  a  slightly  decayed  version  bf  the  single,  cbmppsite  memory*  vector 
that  contains  all  of  episodic  memory.  In  this  model;  recognition  involves  match¬ 
ing  the  to-be-recognized  item  vector  to  the  memory  vector  (i.e.,  taking  the  dot 
product)  and  using  the,  ^sresultihg  ^ar  number  as  a  measure  of  familiarity. 
Recallystarts  by  correlatinglhe  cue  item  vector  with  the  memory  trace,  prbdudng 
a  nbisy  vector  cont^ng  coraponients  representing  versions  of  all  items  assod- 


^  The  term  "hoiographic"  refers  to  the  andogy  between  the  properti^  of  human  associative 
memory  and  those  of  holo^ams  (Pribram^  Nuwr,  &,  Baron,  1974;  WiUtoaw,  1981),  in  p^ci^ 
their  resistimee  to  local  damage  and  the  associative  properties. 
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atedtt6>  the  OTryector  vectpi:  Amt  ihen%  cleApd  tip  to 

produce  a  response  (^,1^  wmptog'the  AWeved  ve^pr  to 
stored  vectors  lepresenid^  ite|A  m^eniantic 


Rajffier  tbA  store  vectors,  (Metc^fe  Eich,  1985)  stores  autjpcPnwiu- 

tibn  vectors  for  each  sin^e  item;  these  are  stored  sdphg  with  the  cpnypiution 
vertpr  for  the?^  stodied‘!pmi^  Aere^y  suipiedvmtp  a 

sih^e  composite  memory  veto  lie  retdet^i  .p^ratiA^^  as  in 

T6Et\M.  to  GHAlRM,  a^  -A'  retneves  a  cpin^sitc  hpi^  versioh  pf  all 

items  convoluted  with  y4,.inciuding;>4  itself,;^  item  ^d  t^oCiatjye  infoiiAa- 
tiru  are  hot  independently  retrieved  (m  they  are  in  TODA^^  As  liisu;^  the 
tri  ce  must  ire  cleaned  up  to  generate  a  respphse  ih^  task.  Reco^tiPh 

cA  be  ‘accomplished  by  cpmparihg  the  retrieved  vector  tp  the  test-yectbr. 


The  second  Class  Pf  CPmposite/distributed  model  explicitly  incorporates  pro¬ 
cesses  of  node  activation  (often  thought  of  as  short-term  iheihory)  as  \rell  m 
weight  mpdificatipn  (the  set  of  weight  rej^resenting,  long-term  memory);  both 
processes  ^ically  being  nonlinear.  The  complexities,  introduced  have  led  most 
investigators  to  explore  these  models- in  the  iprm  Pf  computer  simulations  (with 
the  notable  exception  of  James  ^derson  and  Steve  Grossberg  ~  see  below). 
Such  models  are  often  described  by  the  terms  ’connectipnist’  or  ’rieur^  net\ 
Most  of  the  applications  have  been  to  learning  phenomena,  categorization  and 
classification,  or  perceptual  phenomena,  but  some  discussion  of  applications  to 
memory  is  useful. 


Consider  first  a  representative  back-propagation  model  (Ackley,  Hinton,  & 
SejnPwski,  1985;  Rumelhart,  Hinton,  &  Williams,  1986).  this  model  ^sumes  a 
3-layer  representatidh:  a  layer  of  input  units  or  hetvrork  nodes,  a  layer  of  output 
units,  and  a  middle  layer  of  ro-caJled  hidden  units.  Activation  is  fed  from  the 
input  units  to  the  hidden  um’ts  (using  a  nonlinear  transform),  and  from  these  to 
the  output  units.  All  connectipns  between  layers  have  weights  that  deteimine 
how  much  the  activation  of  a  pafticiilaf,  say,  hidden  uiiit  depends  on  the 
activation  of  a  particular  input  unit.  The  basic  rule  of  the  back-prppagatioii 
model  is  that  these  weights  are  adjusted  during  training  in  prder  to  optimize  the 
correspondence  between  predicted  and  actual  output  veetPrs  (the  back-propaga¬ 
tion  ajgPrithm  performs  a  land  of  least-squares  fitting  procedure).  One  caii  use 
such  a  model  to  perfoim  recognition  and  recall,  in  a  number  of  ways;  perhaps  the 
simplest  is  io  have  each  input  assrreiation  attempt  to  reproduce  itself  at  the 
output  layer.  Then  a  Absequent  teJt  with  an  item  will  tend  to  produce  a  noisy 
version  of  the  a^ociation  containing  that  iteni  at  the  output  layer.  Reco^tion 
can  be  acePmplished  by  matching,  md  recall  by  cleaning  up  the  trace  in  some 
fashion. 


It  has  been  shown  that  thessj  networks  can  represent  virtually  aiy  computable 
mapping  from  input  to  output  layer  (given  enough  hidden  units).  However,  for 
our  purposes  the  important  issue  is  the  way  in  which  such  a  mapping  is  learned 
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^d  .iet^ed  cdhdtibi^,.  been  considered ::b^ 

Mi^q^y  andl€phen^^^^(^^^ 

M^oskey-  and^Gohen  0989)5sho^d  diatim-two^  feacl^ippa- 

gation  -model:  st^ers^  ^^m  "cati^tiopUc  fo^et|^>:  the  second  list  .  10^:  to 

jtopst  coinplete  fptgetog  pftte  Ifet/Si^ 

Ratcliff  (1990).  who  an^yzed  the,  .inpdel^f:]^i^di^pns  for:  lera^tion  .inenipry, 
This  result  is  underetandablp  if  it  ;is  reaiize.d^^^ 

the  >^imectlpn«yOii^^'^b^^^ -^  Ihost^iheceht^^  isia^umed  Aat 

the  inputs  dming^tilie  secpnd  list:  die:  of;  ^cphd^  list  iteim  only.  ^  Ae  start  of 
second-riist  iearmng,  die  weights  Mill. 'b^  cpj^gured  optiin^y  for  the  l^t. 
Hpwever,:  there  is  no  mechamsm':ih  ^e  mpdel:  that  w^  Jteep;  dte  weighte  ffpm 
obtaining  completely  differeht;  v^ues,  pptim^g  the  "recail"  of  die  second-list 
items.  Hence,  affer  a  few  traimng  tri^s  on  the  second  list,  the  netiwrk  will  have 
"forgotten”  the  first  .  list;items;  Ratcliff  (1990)  jdso;shpwed  that  thfe  .mpdel  M 
predict  a  positive  effect.„of  ampurit  of  learning  on  the  d’  ^measure  for  tecpgnidon. 
(Ratcliff  also  showed  that  several  related  mpdeis,  including  the  auto-a^odadve 
mpdel  proposed  by.  McClell^d,  and;  Rurne^  1985  see  below  Med  tp 
resolve  the  .problems.  Reseafch^going  :pn  at  the  tinic  of  this  writing  suggests  a 
number  of  new  approaches  Ihai  might  work;  e.g.  .Sloman  and  Rumelhart,  in 
press;  Kruschke,  in  press;  Lew^dowski|,  in  press.  Beilow  we  shall  discuss  the 
ART  model  of  Grossberg  that  deals  with  the  problem  in  a  rather  explicit 
fashion). 

The  backpropagation  models  are  ’feedforward’  networks:  activation  flows  only 
forward  through  the  system  (the  amount  of  error,  is  in  a  sense  propagated 
backwards  through  the  system  in  order  tp  adjust  the  weights  appropriately,  bi’t 
this,  should  not  be  confused  with  Ihe  flow  Of  activation),  On  the  Other  hand,,  a 
number  pf  models  are  recurrent:  arti\^tion.  that  leaves  a  node,  can  be  fed  back  to 
that  same  node,  po^ibly  ^er  fiowing  through  a  number  of  intermediate  nodesi 
and  the  process  typically  cpntinues  until  a  stable  pattern  pf  activation  results. 
(The  BSB  model  has  this  character,  though  we  did  not  discuss  the  dynamics  of 
activation.) 

Consider  first  the  McClelland  and  Rumelhart  (1985)  model,  hi  brief,  a  set  of 
nodes  accepts  input  ^ni  external  sources,  and  is  folly  interconnected  (except 
that  nodes  dp  not  directly  activate  themselves)  by  directional  links  having 
weights.  Activation  moves  through  the  ^stem  driven  by  the  sum  of  the  external 
and  internal  inputs  to  a  node,  until  a  stable  pattern  .  is  reached.  'Ihen  weights  are 
adjusted  so  as  tp  reduce  the  difference  between  the  internal  and  external  input 
to  each  cell  (so  that  the  network  will  try  to  reprpduce  its  external  inputs). 
Recognition  can  be  accomplished  by  matching  an  input  to  the  stable  pattern  of 
activation  it  produces,  and  recall  by  cleaning  up  the  same  stable  pattern  in  some 
fashion. 
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FiiMy  w^wider;  the  recurrefit }  ART  models  of:  Grossbetg  (evg;.  Grpssbeig;  and 
Stone;  IpM  ai^ou^  'the ,  piigih|d  .mod^  date  back;  before  1976);:  iWe  will: 
describe  a  ^eatly  simplified  ;versipn  of  the  'fteOty  tpn^ve  the  flayor  of  :the 
approach.  Mempiy  consists  pf  a  series  Of  Ordered  layers  6j[  hpdes.  Cpnsider  just 
two  layers,  with  perceptual  inputs  to  layer  1,  (arid,  in  general,  tpp-down  iriptits  to 
layer  2).  In  addition,  weighted  links  exist  inr.bbth  direptiops;  between  nodes  :in  the 
rivo  layers,  and  activations  pass  in  both  directions  along  these  links.  Within  a 
layer,  there  may  also  be  corinections,  but  these  are  inhibitpfy  and-  dp  not  carry 
activations  directly.  The  two  layers  -pass  activation  rapidly  back  and  forth  until  a 
stable  result  is  achieved;  Because  of  irihibitiOn  within  l^er  <2,  a  single;,node  will 
come  to  be  active  in  this  layer,  iri  stable  resonance  with  a  pattern  of  activation 
on  the  nodes  in  layer  1.  The  stable?  pattern  may  be  used  for  reco^tipn  or  recall 
in  vW£^  similar,  to  those  we  have  discussed  already. 

A  particularly  notewortlty  feature  of  the  model  is  its  method  for  picking  the 
sin^e  active  node  in  layer  2.  The  pattern  pf  activations  sent  down  from  this  node 
to  layer  1  is  compared  with  the  pattern  in  layer  1.  If  these  do  not  match  well,  the 
currently  active  layer  2  node  is  turned  off,  the  ^stem  resets,  and  a  new  node  in 
layer  two  wins  the  competition.  This  continues  until  a  good  match  is  found,  until 
a  node  not  yet  used  as  a  template  for  a  pattern  is  found,  or  until  no  nodes  are 
left,  in  which  case  all  layer  2  nodes  become  inactive.  The  result  is  that  different 
patterns  are  assigned  new  nodes,  and  new  learning  does  not  harm  old  learning  in 
the  destructive  fashion  of  other  models  of  this. class. 

The  ./eights  on  the  links  change  continuously  also,  but  at  a  much  slower  rate 
than  the  activation  changes.  Ibe  upward  weight  changes  are  made  so  as  to 
reduce  the  difference  between  a  weight  itself  and  the  signal  passed  upward  along 
that  link.  Thus  a  set  of  weights  leading  to  a  single  active  node  comes  to  correlate 
with  the  activation  pattern  in  the  nodes.  Also  the  downward  links  from  the  active 
node  are  adjusted  to  match  the  activation  pattern  in  the  layer  1  nodes,  so  tliat 
top-down  templates  of  the  presented  patterns  are  learned.  The  weights  leading 
to  and  fiom  ai^  one  layer  2  node  come  to  encode  a  set  of  highly  similar 
patterns,  so  that  each  node  in  layer  2  can  be  thought  of  as  a  category  prototype. 
A  particularly  notewortly  feature  of  this  system  is  the  fact  that  the  system  can 
have  a  distributed  representation  at  some  levels  (e.g.  level  1)  and  a  potentially 
separate  representation  at  other  levels  (eg.  layer  2). 


2.4  Differences  and  similarities 

In  this  section  we  will  compare  the  various  models  on  a  number  of  important 
theoretical  dimensions.  Although  the  various  approaches  that  we  have  con¬ 
sidered  are  superficially  quite  different,  basic  phenomena  are  often  explained  in 
a  similar  manner,  albeit  using  different  terminologies..  This  section  will  focus  on 
the  basic  issues  concerning  the  conceptualization  of  memory  processes.  This 
section  vrill  focus  on  basic  issues  concerning  the  conceptualization  of  memory 
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prpces^.  dfecussion' wUl  de^fwith,i^es>^^^^  but  it 

isiimpdr^t^p  remember  ^at  n6'one  bfrthejl]^these.s?dis^sed5beiq^^ 
ri^t  br  '^ng  in  isolation  :'-*each  imust  be  ;and5TOd  instheiCbn^^^  .of  the  model 
in  wiiichif  isiembeddedi 

Ijitl  Sepc^e  vs  composite  Memoryit^ 

The  question  here  is*  whether  the  model  a^unies  that  ^  are  stored 

in  separate  traces  or  in  one  composite^  memory  trace.  Whether  this  is  a  meaning-r 
ful  theoretical  distinction  depends  bn  :a  number  of  auxiUary/  assuinptions  in  the 
models  in  question.  For  example,  Shiffrin  and  Murnane  (in  pre^,4a)..showed  that 
an  arbitrary  number  bf  events  can  be  stofedin  a  single  himiber’^pn  a  single;  link 
in  a  way  that  allows  each  event  tb  be  retriewd  without  error,  The  methbd  is  hot 
a  pl^sically  realizable  one,  however.  Plausible  composite  systems,  incorporating 
the  equivalent  of  neural  noise,  all  seem  to  have  at  least  one  testable  property: 
When  the  system  is  densely  composite,  then  the  storage  of  new  inputs,  or  even 
the  repetitions  of  old  inputs,  tends  to  degrade  the  representations  of  other  old 
inputs.  Ratcliff,  Clark,  juid  Shiffrin  (1990)  tested  this  notion  empirically  and 
found  that  repetitions  of  some  list  items  did  not  reduce  recognition  performance 
for  other  list  items  (see  also,  Mumane  and  Shiffrin,  1991). 

Shiffrin,  Ratcliffj  and  Clark  (1990)  looked  at  the  implications  for  extant  models. 
All  then  current  models  were  found  wanting.  They  concluded  that  composite 
models  dense  enough  to  predict  forgetting  caused  by  the  composition  property 
could  not  predict  the  findings.  They  concluded  that  models  positing  separate 
traces  had  the  potential  to  predict  the  results,  and  developed  a  variant  of  the 
SAM  model  that  did  so.  This  variant  assumed  that  repetitions  were  accumulated 
in  a  single  trace  (a  kind  of  local  composition  hypothesis-see  below).  It  also 
incorporated  a  ’differentiation’  Itypothesis:  Suppose  two  different  items  A  and  B 
were  not  rehearsed  together;  if  B  stored  in  memory  more  strongly,  then  will  A 
used  as  a  cue  will  tend  to  activate  it  less. 

A  more  local  composition  issue  concerns  whether  two  separate  presentations  of 
a  given  item  are  encoded  separately  or  in  the  same  trace.  That  is,  if  an  item  is 
repeated,  does  the  second  presentation  lead  to  a  strengthening  of  the  originally 
formed  trace,  or  will  a  new  trace  be  formed? 

MINERVA  2  assumes  that  each  separate  encoding  of  a  single  item  (repetition) 
leads  to  a  separate  episodic  memory  trace,  ACT*  assumes  that  repetitions 
strengthen  a  given  trace,  and  the  early  versions  of  SAM  were  somewhat  ambigu¬ 
ous  about  this  point.  Recently  (see  Raaijmakers,  1991),  the  SAM  model  has 
been  extended  tb  deal  specitic^ly  with  the  effects  of  repetition  and  the  spacing 
of  repetitions.  In  this  version,  a  kind  of  study-phase  retrieval  assumption  has 
been  added  to  the  model.  That  is,  on  the  secbnd  presentation  an  (implicit) 
retrieval  attempt  occurs.  If  the  trace  representing  the  first  presentation  is 
retrieved,  it  is  assumed  that  the  new  information  will  be  added  to  the  "old" 
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(4^);  had^to  assume*  "a  sti^ngth^^^^  a^:4p-e^lmn~tfie-1ack;  of  a  ;list-strength- 

ih  iecb^tioh:  This  view vgamedpsu^it  ,^id;  a.stucfy  ^by  MtirnMe:  ^d; 
Shiffriii  (499i).  ®hey  ?tried^4d,  mdiice  sepmate ‘Storage  of  a  repeated  wbrd«  by 
embedding;  it  in  “differeht^^^^  manipulaitipm  prpduwd  the 

ejected  ^^sitiye  ii$t-strength  effect  to  a  dittefent  sort 

suppoi^g  this  view  arises  fibni  a  study  by  koss.  &  LMdauer  (19.7^^^  ^ey* 
showed  that  the  trMtion^  spacing  effect  only  occurs  for  the  probability  of 
fecailihg  single  items  pieseiited  tw^  :and^^  the  probability  of  reccing 
either  oiie  of  two  different  iteihs  each  presented  once.  TTiis  result  seeihs  4b 
require  that  repetitions  of  an ;itemishbuld  be  tfeated^differently' torn  the*  case  of 
midtiple  items,  each  presented  oiice  (although  firm  cohdusions  depend  bn  the 
details  of  each  model). 

2.4.2  Representational  issues 

We  will  consider  three  representational  aspects:  (1)  the  nature  of  the  informa¬ 
tion  encoded  iii  the  memory  trace,  (2)  whether  links  between  memory  traces  are 
assumed,  and  (3)  the  representation  of  "assodative  strength". 

The  models  that  we  have  considered  differ  in  their  assumptions  about  the 
information  that  is  encoded  in  the  memory  trace.  In  the  all-or-none  activation 
model  ACTE  of  Anderson  (1976),  storage  of  a  simple  pairwise  association 
involves  the  formation  of  a  new  liidc  between  pre-existing  network  nodes.  In  the 
ACT*  model,  what  is  stored  is  a  cognitive  unit  representing  the  episodic  experi¬ 
ence.  It  is  assumed  that  such  a  new  network  node  has  associative  links  with 
nodes  representing  the  constituent  parts  of  the  item,  i;e.,  (in  this  case)  stimulus, 
response,  and  list  context.  In  ACT*,  assodative  strength  is  represented  simply  by 
the  strength  of  the  memory  traces.  As  described  above,  these  strengths  deter¬ 
mine  the  amount  of  activation  that  spreads  to  the  trace  from  associated  nodes. 

SAM  and  MINERVA  2  also  assume  that  the  trace  represents  the  "episodic 
experience"  but  are  less  spedfic  about  the  exact  nature  of  what  is  stored.  The 
original  SAM  model  focused  on  the  relation  between  cues  and  images: 
assodative  relations  are  represented  by  a  "retrieval  structure"  rather  than  the 
more  traditional  "storage  structure"^  The  model  does  not  make  use  of  explicit 
assodative  connections  between  images,  thou^  these  are  present  implicitly  in 
the  following  sense:  suppose  two  items  are  studied  together;  when  one  is  used  as 
a  cue  the  retrieval  strength  to  the  image  of  the  other  is  high.  SAM  was  not 
entirely  explidt  concerning  the  nature  of  the  "image"  though  for  most  verbal 
studies  an  image  was  based  on  the  individual  word.  Shiffrin,  Mumane,  Gronlund, 
and  Roth  (1989)  presented  evidence  that  a  good  deal  more  flexibility  is  needed, 
and  that  a  sentence  is  often  a  single  image  (and  that,  under  some  circumstances, 
a  pair  bf  words  is  a  single;  image),  Thus  in  prindple,  a  pair  assodation  could  be 
stored  in  two  ways:  separate  images  governed  by  an  implidt  assodation  that  is 
represented  in  the  retrieval  structure,  or  a  single  combined  image. 
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during  retiieval  that  lets*  die  system  distoguish  whether  hvo  ..stored;  items  me*  m 
the  samedrjdiffererit;  traces. 

Finally,  what  about  the,  models  with  comppsite/distributed  representations?  %e, 
final  fepiesentatipri  is  generaUy  a  yectof  dr.'mat^  aiidfis  a  cpmposite  of  isimilar 
yectorsnr  matrices  (or  degraded  fpnns  of  these)  stored  for  indiyidual  items  and; 
pairs.  The  questiPn  is  the  way  m  which  associations  and  single  iteins  are  handled 
during  storage.  TODAM  has  item  yectors,  and  conyolutions  of  item  yectprs  for 
associations.  Context  information  could  in  principle  be  part  of  each  yector,  but  in 
recent  work  has  been  treated  as  a  separate  yector.  CHARM  treats  single  items 
as  autoconvolutions,  but  is  otherwise  similar  to  TODAM  in  most  respects.  The 
Matrix  model  treats  indiyidual  item  yectors  separately,  .  and  context  as  a  separate 
vector.  Single  items  are  stored  :as  an  assodation  matrix  made  fi*om  the  item, 
context  and  a  unit  vector.  Pairs  ,  are  matrices  made  fi:om  the  .product  of  the  two 
item  vectors  and  the  context  vector.  One  issue  left  unresolved  for  these  models  is 
the  basis  by  which  some  types  of  information  are  encoded  in  a  given  vector^ 
while  other  types  are  singled  out  for  treatment  as  a  separate  vector.  For 
example,  how  would  category  information  be  treated?  (see  Humphreys,  Wiles  & 
Bain,  1991,  for  one  possible  solution). 

A  more  general  solution  to  this  problem  is  possible  if  the  various  types  of 
information,  and  various  items  to  be  associated,  are  all  treated  as  components  of 
a  single  vector,  or  single  pattern  of  activation  across  a  set  of  nodes.  For  example, 
in  the  McClelland  and  Rumelhart  autoassociative  recurrent  model,  and 
Grossbeig’s  ART  models,  all  items  to  be  associated,  and  related  information,  are 
encoded  as  a  single  vector  or  pattern  of  activation  values  sent  to  a  set  of  nodes. 
Anderson^s  BSB  model,  and  various  versions  of  feedforward  backpropagation 
models  use  either  of  two  methods.  In  one  method,  similar  to  those  in  the 
recurrent  models  just  mentioned,  items  to  be  associated  are  encoded  together  in 
a  single  input  vector  (for  example,  the  model  of  Ackley  et  al.,  1985,  tries  to 
reproduce  at  the  output  layer  the  vector  presented  to  the  input  layer).  All  such 
models  use  a  pattern  completion  property  to  retrieve  associates.  In  the  second 
method,  the  items  to  be  associated  are  treated  as  separate  vectors;  for  example, 
the  input  layer  coiild  encode  one  item  and  the  output  layer  could  encode  the 
associated  item  (J.A.  Anderson  et  al.,  1977). 

2.4.3  Contextual  encoding 

Any  model  that  is  designed  to  explain  data  from  episodic  memoty  experiments 
must,  somehow  account  for  the  fact  that  a  paired  associate  item  such  as  apple- 
engine  can  be  learned  despite  the  presence  of  strong  competitive  semantic  associ- 
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It  therefore  iseems  highly  desifablevfof  a.inPdel  of;  memoiy  to  have^some;  mea^ 
of  selectively  accessing  memory,  tracer  stored  under  parti^ar  tempor#  or 
contextual  conditions.  Npte  that  a  simple  recehi^-based  mechahism  vvdll  hot 
suftice-subjects  can  access  dnfoimatioh  ^m  rcontexto  identiiSable  peripds  in 
the  past  Tp.give  just'one  example,. subjecte:  are  able  to.'access;selectively  not  the 
list  learned  most  recently  but.  the  list  learned  prior  to  that  (Shiffrin,  1970; 
Anderson  &  Bovver,  1972). 

Most  models  incorporate  contextual  associations  ,  as  the  means  to  focus  retrieval 
processes  in  episodic  memory,  either  by  including  the  contextual  infonnation  in 
the  memory  trace,  or  by  treating  the  contextual  information,  as  a  separate  item. 
Whether  this  alone  will  suffice  is  an  open  question*  For  example,  ACT*  and 
MINERVA  2  assume  an  additive  rule  for  combining  the  associative  strength  due 
to  context  and  item.  Such  a  rule  may  not  have  a  sufficiently  strong  focusing 
effect  to  eliminate  strong  interference  by  preexperimental  associations.  The 
multiplicative  combination  rules  used,  for  example,  in  SAM  and  the  Matrix 
model  are  such  that  retrieval  is  focussed  on  those  traces  (or  those  components  of 
the  composite  trace)  that  are  consistent  with  the  context  at  test.  Even  a 
multiplicative  rule  may  not,,  by  itself,  be  sufficient  to  focus  retrieval  properly.  For 
example,  Humphreys  et  al.  (1989)  call  attention  to  crossed-associates  lists,  in 
which  the  subject  is  asked  to  learn  pairs  like  doctor-king  and  queen-nurse. 
Versions  of  the  SAM  model  in  which  individual  words  (but  not  pairs)  have  a 
single  (semantic)  memory  representation  would  not  easily  predict  the  learning 
seen  in  such  cases.  However,  SAM  models  typically  assume  that  images  are 
episodic  in  nature,  not  semantic. 

It  should  be  no  surprise  that  models  that  do  not  incorporate  context  will  not  fare 
well.  For  example,  a  model  that  does  not  include  a  way  to  reduce  the  effect  of 
irrelevant  associations  will  have  serious  problems  explaining  why  the  interfering 
effect  of  the  number  of  items  on  a  single  experimental  list  is  not  completely 
swamped  by  the  millions  of  previously  acquired  associations.  A  simple  forgetting 
assumption,  i.e.,  a  reduction  of  strength  for  previous  associations  (as  in 
TODAM),  will  not  do  the  job  without  added  assumptions  about  context;  The 
strong  empiricd  list-length  effects  would  require  too  rapid  and  massive  forget¬ 
ting. 

2.4.4  Storage 

The  issue  we  focus  on  here  is  the  predicted  effect  of  increasing  study  time  for  a 
list.  TODAM,  CHARM  and  the  Matrix  model  provide  examples  of  models  in 
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Murdock,  1989)  have  propbsed:  a  prbb^ilistic  enco^g  ^sumptibn:  Each 
feature  bf  an  item  is-  encoded^  (stored)  with  a=:  probability  P'  that  mes  with 
presentation  time.  If  nbt  -stored'  it  is  given  a:  neutral^  ^  variant 

discussed  by  Shprin  et  al.,  1990,;  replaced  by  iU  random  value).  Metc^e  Eich 
(1985,  Pi  2^)  proposes  a  A^arit  inHrhicb^^  mi  item  me  Cither 

encoded  or  hot  (dl-or-rnone  encoding);  :In  all  these  varimits^  boUi  reftotitions  of 
ah  item  and-  ihcreased-stody  time„wiUiimpnjvc:storage:;felative  .ta^variance::ih  the 
system,  and  therefore  increase  performance. 

Shiffrin  et  al.  (1990)  discuss  an  alternative  way  in  which  these  models  might 
show  a  lemning  or  repetitibn  effect.  This  alternative  is‘  based  on  the  fact  that 
performance  in  these  models  is  related  to  the  signal-to-iioise  ratio  (or  d’).  Since 
d’  measures  the  ratio  of  mean  signal  strength  to  the  standard  deviation,  d’  can 
show  an  increase  with  repetition  if  a. constant,  is  added  to- the  standard  deviation. 
The  reason  for  this  is  that  the  standard  deviation  will  no  longer  be  completely 
proportional  to  the  mean  signal  strength;  It  is  natural  to  suppose  that  the 
constant  represents  activation  of  traces  or  trace  components  from  lists  other  than 
the  one  being  tested,  or  from  extra-experimental  memory.  (More  generally,  this 
assumption  may  prove  useful  in  all  models  because  it  lessens  the  effect  of  list 
variables  like  list-length  and  study  time  in  accord  with  the  amount  of  extra-list 
activation). 

The  remaining  models  predict  performance  increases  with  repetitions  or  study 
time  for  fairly  obvious  reasons:  storage  of  stronger  associations  in  SAM  or 
ACT*,  or  weight  changes  that  produce  better  encoding  in  the  neural  net  models. 

2.4.5  Retrieval 

One  bf  the  major  differences  between  the  models  discussed  here  concerns  the 
manner  in  which  the  retrieval  process  produces  a  recalled  item.  In  SAM  separate 
traces  are  accessed  separately,  t^o  the  recovered  infonhation  can  be  cbmpared  to 
a  standard  lexicon;  SAM  doesn’t  provide  any  details  bf  this  process,  but  simply 
assumes  the  probability  of  successful  recall  rises  with  the  strength  of  the  cues  to 
image  relationship.  The  ACT  models  use  siinilar  probabilistic  rules.  MINERVA 
2  also  has  separate  storage  but  retrieves  a  composite.  This  coniposite  could  be 
compared  with  the  individual  stored  traces,  but  this  seems  unsatisfactory  because 
recognition  is  also  assumed  to  be  a  composite  process.  Hintzman  (e.g.  1988) 
proposes  a  somewhat  more  satisfactory  solution  in  which  the  composite  retrieved 
vector  is  used  a  subsequent  retrieval  cue,  the  process  continuing  in  this  way 
until  the  retrieved;  vector  comes  to  represent  an  unambiguous  item.  ART  also 
has  separate  storage,  and  test,  probes  come  to  activate  soiiie  single  node  in.  at 
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neural  net  models  do  not  offer deaf  solution  for  cases  in  which-  fee  retrieved 
trace:  is  ndisy  enou^:  to  be  ambiguous.  Typically  ,a  proibabilistic  recall  rule  is 
adoptefe  based  ph  the  match  of  fee  lehieveditface  to  possible  response^  If  fee 
mddel  is  fully  cdmposite,  however,  itris  hdt  entifply  clear  where  the  cPmparisdh! 
stimuli  Tie. 

A  second  issue  involves  whether  fee  fetiievai'  process  is  ^sumed  to'  be 
probabilistic  of  not.  Both  ACT*  and  SAM  assume'  a  probabilistic  retrieval 
process.  In  these  mddels,  an  iteih  that  was  not  fetfieved  on  a  first  retfieval 
attempt  may  still  be  retrieved  if  an  additional  attempt  at  retrieval  is  made.  (In 
SAM  it  is  usually  assumed,  howeyer,  that  at  least  one  new  cue  must  be  used  for 
a  subsequent  retrieval  to  have  a  chance  at  success).  The  other  models,  on  the 
other  hand,  are  such  that  a  second  attempt  will  always  lead  to  the  same  result 
(unless  the  cues  are  changed,  or  have  added  noise,-  see  McClelland,  19??). 

Finally,  only  a  few  models  (namely  SAM  and  the  convolution/correlation  model 
of  Metcalfe  and  Murdock,  1981)  have  been  applied  to  extended  search  processes 
as  in  free  recall,  in  which  the  subject  uses  a  number  of  different  retrievfe  cues  in 
order  to  maximize  recall.  It  might  be  argued  that  the  search  strategies  that  are 
probably  inwlved  in  these  paradigms  are  not  part  of  the  "basic"  or  "elementary" 
memory  processes.  However,  such  a  viewpoint  does  not  do  justice  to  the  fact  that 
maity  real-life  situations  do  involve  this  type  Pf  unstructured  memory  retrieval. 

2.4.6  Forgetting 

Let  us  define  forgetting  as  a  failure  to  retrieve  information  from  memory  at  time 
B  when  it  was  retrievable  at  an  eartief  tiihe  A,  or  as  a  decrease  in  the  probabil¬ 
ity  of  retrieval;  There  seem  to  be  three  b^ic  ways  iii  which  foigettiiig  might 
occur:  (1)  a  decrease  in  the  "strength"  of  the  meinory  trace,  i;e.  decay;  (2)  ah 
ihvTease  in  competition  by  other,  interfering,  traces  (or  iteins);  and  (3)  a  change 
in  fee  nature  of  fee  cue ‘between  time  A  and  time  B,  i.e.  a  change  in  fee  (func¬ 
tional)  stimulus.  There  does  not  seem  to  be  aiy  difference ’between  the  models 
with  respect  to  fee  third  aspect,  although  not  dl  of  them  have  explicitly  dealt 
with  it.  Mensink  and  Raaijmakers  (1988)  have  used  this  idea  in  their  application 
of  the  SAM  model  to  interference  and  forgetting.  In  this  model,  part  of  the 
forgetting  was  assumed  to  be  caused  by  contextual  changes,  i.e.  changes  in  the 
contextual  cues  between  study  and  test. 
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including  TODi^i)* 

All  the  memory  models  considered  in;  this  , chapter  prc^^  a  decrease  in-perfprin- 
ance  due  to  learning  other  items,  and  to  leaibing  pther  pairs  pf  items  (in  bplh 
AB-AC  and  AB-CD  type  tasks),  the  only  general  exception  occurring  when  the 
other  items  are  rehearsed  or  coded  jointly  with  the  items  in  question.  In  general, 
several  mechanisms  in  each  model  help  produce  interference,,  these  mechanisms 
may  be  different  for  different  tasks>(as  in  SAM)^  and  the  mechanisms  , may  differ 
between  models.  We  mention  here  a  few  of  the  more  interesting  differences 
among  the  models. 

Most  composite  models  incorporate  explicit  interference  due  to  the  superim¬ 
posed  storage  assumptiom.  When  vectors  or  matrices  are  added  together,  or 
when  a  set  of  weights  are  jointly  adjusted  for  each  new  input,  the  result  tends  to 
be  degradation  of  the  representations  of  each  item.  There  are  of  course  excep¬ 
tions  to  this  rule:  If  memory  is  large  enough  relative  to  the  size  of  the  inputs, 
then  storage  might  be  effectively  separate  (the  amount  of  superimpositipn  might 
be  minimal;  see  Kanerya,  1988),  or  if  the  inputs  arc  orthogonal  enough,  or  if  the 
system  orthogonalizes  or  separates  the  inputs  (e.g.  Grossberg’s  ART  models), 
then  interference  would  not  be  mandated  by  the  factor  of  composite  storage. 

The  remaining  sources  pf  forgetting  are  posited  to  arise  during  the  course,  of 
retrieval  (in  SAM  these  are  the  only  sources  of  forgetting).  SAM  assumes 
summation  of  activations  at  retrieval  to  accomplish  recognition;  as  a  conse¬ 
quence,  extra  items  cause  forgetting  by  increasing  ’noise’.  In  MINERVA, 
composition  during  both  recall  and  recognition  causes  interference  due  to 
inaeasirig  noise.  One  chief  remairiing  cause  of  interference  is  based  on  the 
relative  strength  of  storage  of  different  iteriis.  For  example,  , in  SAM,  sarnpling  in 
recall  is  based  on  a  ratio  of  activation-  strengths.  Reduction  in  relative  strengths 
of  targets  due  to  extra  items.alsp  plays  an  important  role  in.rnar^  of  the  models 
under  discussion,  especially  the  ACT  mpdels.  This  factor  plays  a  chief  role  in 
accounting  for  listJength,  fan,  and  cue-oyerload  effects. 
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Inthisf^c^pii,  the  yaripus;mpdels*wili.b.e/c6mp^d}to^ 
cert^.basic  findings^n^mfempry  research, -bptiiiqu^tative:^dq^ 

3.1'  Cued  rwfdl 

Gued  recall  is  the  basic  paradigm  for  associative  memory,  and  the-present.set  of 
models  have  been  formulated  so  that  cued  recall  predictions  can  be  made.  ACT* 
and  CHARM,  havei  been  appUed  morc'  or  less  exdusively  to  cued  recaU  data.  In 
addition,  ACT*  and-  SAM  have  been  shown  to^  be  able  to  account  for  both, 
latency' as  well  as  accuracy  data  in  cued  recall:  ^Anderson,  1981;  Mensink  & 
Raaijmakers,  1988). 

3.1.1  List-length 

All  the  models  are  capable  of  handling  the  basic  list-length  effect.  However,  in 
some  models  (TODAM,  CHARM)  no  distinction  is  made  between  the  to  be 
recalled  list  and  extra-list  and  extra-  experimental  information.  When  list-length 
is  predicted  to  have  an  effect,  it  does  so  because  retrieval  is  restricted  to  the 
to-be-recalled  list  (without  explanation).  This  seems  unsatisfactory,  and  the 
natural  way  to  resolve  the  difficulty  would  be  the  adoption  of  some  fprm  of 
contextual  cuing  (as  is  the  case  with  other  models). 

However,  whether  a  contextual  cue  is  used  may  be  less  important  than  how  it  is 
used.  A  typical  multiplicative  rule  for  cue  combination,  tends  to  focus  access 
upon  regions  of  memory  in  the  intersection? of  the  sets  of  memory  traces  evoked 
by  each  cue  separately,  whereas  a  typical  additive  rule  tends  to  access  traces  in 
the  union  of  these  sets.  Humphreys  et  al.  (1991)  argue  quite  convincingly  for  the 
intersection  approach,  implying  that  "strengths”  or  "activations"  should  be  acted 
upon  in  a  way  functionally  equivalent  to  multiplicatfon  (as  in  the  SAM  model, 
the  Matrix  model,  etc.)  rather  than  addition  (as  in  ACT*). 

Parenthetically  it  might  be  noted  that  thL  type  of  explanation  of  list-length 
effects  sees  such  effects  as  an  example  of  a  more  general  effect,  i.e.  that,  the 
efficacy  of  aiy  probe  cue  is  inversely  related  to  the  number  of  memory  traces  or 
items  that  are  associated  to  that  probe  cue  (which  might  be  called  the  length  of 
the  list  of  associated  items). 

3.1.2  Interference  &  forgetting 

The  basic  issues  here  are  the  effects  of  different  types  of  interference  (i.e. 
AB-AC  vs  AB-CD),  mechanisms  for  (relative)  spontaneous  recovery,  single-list 
forgetting  paradigms  and  whether  or  not  some  sort  of  decay  notion  is  used. 
ACT*,  SAM,  and  CHARM  have  all  been  explicitly  applied  to  such  phenomena. 
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Andersph  (1983a)= Mensink  and  Raaijinidqere  (49^)“sli(iw  tha^ 
in  1^  area:  necessitate^  gumption  ^at  is  based  on  tK)&  relative  and 
absoluteiassddative;stren]^  Relative  strong  is  ;a  jfonctibn'  of  .the  number  and 
streh^  of  other  asspdatidi^#hiler  al^lutd  stien  indexed  by  |he  ampiint 
oi  study  time  or  the  numbdf  of  presentations  of  an  item.  In  Al^*,  absolute 
strength  comes  in  through  the  a^iimption  that  trace  fonnation  ik  more  likely  as 
the  totd  study  time  increases.  In  SAM^  absolute  strengths  ;detennine  the 
probability  of  recovering  enough  information  from  the  trace  to  ^Ve- the  name  of 
the  item  las  a  respbnse. 

Mensink  and  R^jmakers  (1988)  piesent  a  theoretical;  analysis  of.  {traditional, 
interference  phenomena.  They  show  that  modem  membry  models  such  as  SAM^ 
are  able  to  reconcile  phenbmeha  that;  have  been  problematic  for  traditional 
interference  theories.  Such  analyses  bring  out  a  number  of  tacit  assumptions  in 
the  typical  verbal  (i.e.  non-quantitative)  models  that  are  not  usually  noted. 


3.2  Free  recall 

This  paradigm  is  more  complex  than  cued  recall.  This  is  due  to  the  fact  that  it 
not  only  necessitates  an  exact  formulation  of  the  relation  between  STS/worldng 
memory  and  long-term  memory  -but  also  a  description  of  search/retrieval 
strategies.  Only  a  few  of  the  models  have  explicitly  dealt  with  such  data.  We  will 
briefly  discuss  predictions  by  SAM  (Raaijmakers  &  Shiffrin,  1980)  and  an  early 
version  of  the  CHARM  model  (Metcalfe  &  Murdock,  1981). 

SAM  assumes  that  contextual  and  interitem  associations  are  built  up  as  a  result 
of  rehearsing  the  items  in  STS.  A  buffer  process  (Atkinson  &  Shi^n,  1968)  is 
used  to  model  the  rehearsal  process.  Retrieval  starts  by  outputting  any  items  still 
in  STS.  Thereafter  the  retrieval  process  is  modelled  as  a  series  of  retrieval 
attempts  either  with  the  context  ciie  alone  or  using  both  context  and  a  previously 
retrieved  item  as  probe  cues.  This  process  continues  until  the  number  of  failed 
searches  reaches  a  specific  criterion. 

One  of  the  strong  points  of  the  SAM  model  is  that  it  handles  with  a  single  set  of 
parameter  values  data  from  lists  with  large  variations  in  presentation  rate  and 
list  length.  The  latter  result  is  predicted  because  the  search  termination  criterion 
is  exceeded  sooner  for  the  lists,  relative  to  list  length:  relatively  fewer 

samples  are  made  from  a  longer  list  than  from  a  shorter  list.  This  prediction  is 
characteristic  of  sampling-with  replacement  search  models  with  a  fixed  stop 
criterion.  It  also  subsumes  the  cue-overload  principle  proposed  by  Watkins 
(1975;  see  also  Mueller  &  Watkins,  1977;  Watkins  &  Watkins,  1976).  This 
principle  states  that  the  piobability  of  recalling  ary  particular  item  decreases 
with  the  number  of  instances  associated  to  the  retrieval  cue. 
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termu^io^f  of  i^ovoiution^qmlatio^^^^  .the)  .a<^£d;  simulation  model 
doesinpi  in  laaii^the^matiiepiatical’pperatioj^pf 'Convoke 
Listead;.  .aU  orrnpne  asspm  stpredf  j^tween  list  itemsi,.wdV^  list 

items  and  ppntext  (treatedhas  a  pst;item)./;^en  as  a  cupi  a 

random  choice  is  retrieved  peife<4y  from  the.^stprediasso^^  if  ary. 

Ilie  rehears^  process  is  cpnceptualiKd  ia>  continuous  ctting  of  the  mempry. 
vector  vdth  tiie  qirrently  avmlaiBle  item,  ^us,  iwhen  an  item 
associated  tp  the  item  that  is  qurentiy-available  (qintext  at  the  ;s^ 

Then  the  just-presented  item  is  used  as  a  cue  to  generate  an  item  To  w^ch  it 
been  associated,  and  then  thii  item  is  u^d  as  a  :new  cuci  etc.  This:  continues 
until  the  next:  item  is  presented  which,  is  then,  associated  to  the  item  that  is 
currently  avmlable. 

At  the  time  of  recall,  the  last  rehearsed  item  is  recalled  and  used  as , a  cue  to 
generate  another  item;  then  this  item  is  itself  used  as  a  cue,  and  this  continues 
until  a  certain  criterion  amount  of  time  has  passed  without  any  new  items  being 
recalled.  At  that  point,  context  is  reinstated  as  a  cue  and  the  process  begins 
anew  and  continues  until  the  criterion  amount  of  time  passes,  for  the  second  time 
without  aity  new  recalls. 

Each  of  these  models  predicts;  serial  position  effects.  Since  the  SAM  model  is 
based  on  the  two-store  framework,  it  should  not  come  as  a  surprise  that  it  makes 
maity  of  the  same  predictions  as  the  classic  two-store  model  (see  Atkinson  & 
Shiffrin,  1971;  Raaijmakers  &  Shiffrin,  1980),  and  for  the  same  reasons:  primacy 
is  predicted  because  of  the  cumulative  rehearsal  assumption,  while  the  output 
from  STS  leads  to  a  recency  effect.  Although  the  two-store  model  is  often 
described  in  textbooks  as  having  problems  handling  data  on  levels-of-prpcessing 
and  recency  effects,  this  is  in  fact  not  con'ect  (see  Raaijmakers,  1991). 

The  Metcalfe-Murdock  model  has  a  quite  different  flavour.  In  this  model,  the 
shape  of  the  serial-position  curve  is  critically  determined  by  the  cues  that  are 
available  at  recall.  Recency  is  predicted  because  the  last  presented  item  is 
recalled  first  and  then  used  as  a  cue.  This  item  is  assumed’  to  be  the  optimum 
entry  point  into  the  end  of  the  list.  The  disappearance  of  the  recency  effect  by 
the  introduction  of  a  delay  between  presentation  and  test  is  explained  by  the 
assumption  that  rehearsal  continues  during  the  delay.  Hence,  at  the  end  of  the 
delay  the  currently  available  item  will  most  likely  be  some  other  item  than  the 
last  item  on  the  list.  The  optimum  entry  point  for  recall  of  the  .last  few  items  is 
therefore  lost.  This  explanation  seems  uitiikely  since  providing  the  subject  with 
the  terminal  item  after  the  delay  interval  should  reinstate  the  recency  effect. 
(Another  problematic  aspect  is  the  assumption  that  rehearsal  continues  during 
the  delay  filled  with  arithmetic.) 


P^a^^is  ipircdcted^by  'tBis  incidel-  bediusc^’w^  retrie^Vcue^  (in 

the  jsecond  ;pha^  pt  ^eS  -rec^V^rpicess^^^  nearly  'alw^  rdatively 

stron^y  asspqated  mlht  the  ’^t- iteto 

tie  t^ic4  twt^store  e^Matiph  Pf  -primal  -being;  due:  'tp;  stippger.  trace.s^^>M 
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One  of  the  more  important  adv^tages  of  ibe  recent  work  on  models  of  memory 
is  that  it  has  led  to  model-biwed  simulation;  progrtm^  for  spei^c  expenmerital 
tasks.  These  prograim  caii  then  be  used  tb' see  Bow  ^the  model  behaVes^  under 
specific  e^^rimental  conditions.  This  is  especially  important  in  free  recall  since 
this  paradi^’ does  hot  lehdlitseif^^asiiy<tbl^£dytic  approaches; 

One  aspect  of  the  data  where  this  has  been  proven  to  be  very  helpful  is  in  the 
analysis  of  the  effects  of  various  types  of  cuing  manipulations  of  the  likelihood  of 
recall.  We  will  mention  two:  the  (positive)  effects  of  category  cues  and  the 
(negative)  effects  of  cuing  with  randomly  selected  list  items  (the  so-called 
part-list  cuing  effect). 

Raaijmakers  and  Shiffiin  (1980)  showed  that  typical  effects  of  cuing  with 
category  names  could  be  easily  predicted  by  the  SAM  simulation  model.  These 
predictions  do  not  greatly  depend  on  the  specific  assumptions  of  SAM  (vis-a-  vis 
alternative  models).  Such  analyses  are  however  important  to  show  that  observed 
effects  are  indeed  consistent  with  particular  theoretical  frameworks. 

This  is  even  more  the  case  in  the  part-list  cuing  paradigm.  In  this  paradigm 
subjects  are  given  some  randomly  selected  iteihs  from  the  list  as  cues  for  the 
remaining  list  items.  The  typical  finding  is  that  such  cuing  leads  to  a  slight  but 
unexpected  decrease  in  the  probability  of  recall  for  the  remaining  items. 
Raaijmakers  and  Shiffrih  (1981)  spent  a  good  deal  of  effort  analyzing  this 
peculiar  effect  within  their  SAM  simulation  of  free  recall.  They  showed  that  this 
counterintuitive  effect  was  in  fact  predicted  by  the  model.  In  addition  to  the 
basic  result,  a  number  of  related  findings  were  predicted.  These  indue  d  the 
number  of  cues,  the  time  at  which  the  cues  were  given,  arid  the  effect  of 
interpolated  learning  (between  preseritation  arid  test).  Raaijmakers  (1991)  shows 
that  the  model  predicts  a  reversal  of  the  cuing  effect  if  a  delayed  testirig  pro¬ 
cedure  is  used.  This  prediction  is  indeed  borne  out.  This  research  has  also  shown 
that  it  is  by  no  means  easy  to  intuit  the  predictibris  of  a  relatively  simple  model 
such  as  in  a  complicated  expennierital  situation. 

This  part-list  cuing  effect  has  also  been  dealt  with  by  Metcalfe  and  Murdock 
(1981).  However,  iri  their  Emulation  it  was  assumed  that  the  list  cues  were  not 
actually  used  by  the  subject.  This  assuriiption  makes  it  relatively  easy  to  predict  a 
negative  effect  of  cuing  but  does  riot  make  much  sense  given  the  fact  that  most 
subjects  will  expect  the  cues  to  be  helpful  (as  did  most  memory  spedalists).  In 
addition,  such  an  approach  makes  it  impossible  to  predict  a  reversal  of  the  cuing 
effect  in  delayed  cuing. 
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^bte  /however  that  pipWems.  f^e  not-  due  to  -the  b^ic  stracttirc.  of  &e 
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tiohs  that  are  made  cphcernihg  the  subject’s  strate^.  This  illustrates  that 
predictions  for  ftee  recaUitasks  critically  depend  on.  ±e  strategy  that  is  used,  i.e. 
oh/the  assumptions'  that  .are  made^^  of  cues  . to  be  used  in 

retrie^;.  Grohlundiiuid;Shiffirin.(4986),  exaj^e,d;^tlie  etfects  pf  .^ 
strategies  6h;  recaij  fiom;  natui^  ^tegPries; /^d  categori^d  liste  They  showed 
that  differentistrategies  indeed  have  an^effect/pn;  recall/perfornmce.  This  result 
poses  two 'problems  for  aiy  model  for  free  recall,,  First,  it  makes  .it  problematic 
toi  apply  a  sjrrecific  (arbitrary)  version  of  the  model  to  the  data  of  a  group  of 
subjects,,  unless  it  call  be  shpwn,  that  the  result  of  interest  is  insensitive  tp  the 
choice  of  strategy  or  rhat  fhe  subjects  all  use  a  similar  strategy.  Second,  given  a 
specification  of  retrieval  strategies  (i.e.  in  terms  of  the  sequence  pf  cues  that  are 
used),  the  model  should  be  able  to  give  a  quantitative  account  of  the  resulting 
performance  differences.  Gronlund  and  Shiffirin  (1986)  show  that  a  simple 
extension  of  SAM  could  account  for  the  observed  differences. 


3,3  Recognition 

Most  current  models  of  memory  assume  that  sirnple  recognition  decisions  are 
based  on  some  sort  of  global  familiarity  value.  By  this  we  mean  that  the  familiar¬ 
ity  value  is  a  kind,  of  weighted,  additive  combination  of  the  activation  of  all  items 
in  memory.  This  global  familiarity  value  is  determined  by  the  match  between  the 
probe  cues  and  the  memory  trace(s).  This  general  type  of  model  has  been 
termed  the  General  Global  Matching  Model  (GGMM,  Humphreys  et  al.,  1989) 
or  the  Interactive  Cue  Global  Matching  (ICGM)  model  (Clark  &  Shiffrin,  in 
press).  As  these  labels  imply,  such  models  differ  fi'om  previous  local  matching 
models  in  that  all  items  in  memory  are  involved  in  the  niatch,  not  just  the 
representation  of  the  tested  item.  In  this  section  we  will  consider  some  of  the 
data  that  have  been  used  to  test  these  models. 

3.3.1  Pair  recognition 

Pair  recognition  has  been  used  as  an  experimental  paradigm  to  test  aspects  of 
recognition  models.  Basically,  the  issue  here  is  the  way  in  which  associative 
information  is  assumed  to  contribute  to  recognition  decisions.  In  these  experi¬ 
ments  the  subject  first  studies  a  list  of  word  pairs  ...).  'At  test,  intact 

pairs  {Afi^  have  to  be  discriminated  from  rearranged  pairs  (/4jBj),  mixed  (AjX) 
and/pr  from  new  word  pairs  (AT).  These  results  may  be  comp^ed  to  those 
obtained  in  single  item  recognition  (4i  vs.  Ji)  and/or  cued  recognitipn  (AiBi  vs, 
AjX^  where  only  the,  second  item  has  to  be  judged;  see  Clark  &  Shiffiin,  in,  press). 
Humphreys  et  al.  (1989)  show  that  aU.  extant  versions  pf  the  glob^  matching 
model  (SAM,  MINERVA  2,  Matrix,  and  TODAM)  lead  to  similar,  equations  for 
the  mean  matching  strengths.  This  would  seem  to  imply  that  it  wili  be  difficult  to 
differentiate  between  these  models.  However,  predictions  for  d’  depend  not  only 
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the  meim  stiehgths  ph-the  vim  Furthermore,  -t  be'pspssible  to 

dirtinguish  betweeh  ^e=  models  if  one  also  takes  the  ^predictions  for  single  item 
recognition  and  <med  rec6j^tion  mto:  aca>uni. 

Clark  ^d-Shi^nn  (in  press)  exammed^the’predi(^bns^for:^:Q^s  of  recognition 
teste:  iTiey  show  tha^the  models  differ  with' respect  to  whether  they  predict  an 
advantage  for  cued  recognition  compaied  to  ^sinj|le  item  recb^tiom  The  residte 
of  their  experiments  werb  reasonably  well  predirted  by  TODAM  and  SAM  with 
TODAM  producing  the  bert  fit;  MiNER\A  2  and  the  Matrix<»model-  did  not  fit 
the  data  well.  One  problem  with  such  data  however  is  that  it  nught  very  well  be 
the  case  that  subjects  make  use  of  recall  prbcetees  in  additibn  to  glbb^  match¬ 
ing.  That  is,  the  logic  bf  the  models  ^Ibws  subjects  to  supplement  global 
matching  with  recall. 

Gronlund  and  Ratcliff  (1989)  pointed  to  another  problem  for  global  matching 
models.  They  examined  the  time-cburse  bf  the  availability  of  item  and 
associative  information  using  a  response  signal  procedure  (Reed,  1973,  1976; 
Dosher,  1976).  In  this  procedure,  a  recognition  decision  must  be  made  at  one  of 
several  pre-defined  times  after  the  onset  of  the  test  stimulus.  With  this  pro¬ 
cedure  it  is  possible  to  determine  the  growth  of  accuracy  as  a  function  of 
processing  time.  Gronlund  and  Ratcliff  showed  that  item  i^ormation  becomes 
available  sooner  than  associative  information.  This  poses  a  problem  for  global 
matching  models  since  these  treat  these  two  types  of  information  as  inseparable. 
To  accommodate  the  results,  separate  contributions  of  item  and  associative 
information  are  required,  possibly  by  distin^ishing  between  concurrent  and 
compound  usage  of  cues  (see  Gronlund  &  Ratcliff,  1989).  That  is,  it  might  be 
assumed  that  memory  is  probed  in  parallel  with  an  interactive,  compound  cue 
and  with  the  item  cues  separately.  As  an  alternative,  it  might  be  the  case  that 
pair  images  are  sometimes  stored,  and  that  the  time  course  of  pair-image  activa¬ 
tion  differs  from  that  of  single-item  image  activation. 

3.3.2  List-length  vs  list-strength 

Recent  research  by  Ratcliff,  Clark  and  Shiffrin  (1990)  has  focussed  on  the  effects 
of  be  strength  of  other  list  items  on  the  recall  and  recognition  of  target  items. 
This  so-called  "list-strength  effect"  concerns  the  effects  of  strengthening  (or 
weakening)  some  list  items  upon  memory  for  other  list  iteihs.  Ratcliff  et  al. 
(1990)  showed  that  strengthening  some  items  in  the  list  decreases  recall  of  the 
remaining  list  items  but  has  no  or  even  a  positive  effect  on  recognition  pen 
formance.  This  contrasts  with  the  list-length  effect:  adding  items  to  a  list 
decreases  bbth  recall  and  recogiiition  perfomiance.  Thus,  the  number  of  irrel¬ 
evant  items,  but  not  their  strength,  affects  recognition.  This  is  true  hot  only  for 
strength  variations  due  to  amount  bf  study  time  but  alsb  for  \^ations  due  tb 
spaced  repetitions. 
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between  presehtation  of- .t^  ^different  iteim  yefsm  of  ajsm|i(B! 

itenl  ShiJ^  et  ali  :(19^)  showed:  that  raijeht;  memoty  models'  indeed  c^Ot, 
predict  bo^  the  presence  of  a  lisMen^  eitfed  and  the  absence 
a  list-strength  effect. 

ShiflBrin  et  al..  (1990);  also  investigated  whether  t^  various  .models  ^would;  allow 
modification  so;  as  to  be  able  to  prediet  the^  results.  This  does  :n6t  seem  ito  bo 
possible  for  -those  models  that  a^ume  that  items  aie  stored;  in  oiie  composite 
memory  trace.  Even  corwidering  recogiutidn'  O^^^  these  models  c^oti  predict 
both  a  positive  fist-length  effect^^^^  absent  or  negative  list-stren^  effect 
when  strength  variations  are  due  to  spaced  rej^titions.  Models?  such  as  SAM  and 
MINERVA  2  that  assume  separate  storage  are  in:  principle  better  equipped  to 
handle  these  results  although  they  too  will  have  to  be  modified  to  be  able  to 
predict  negative  list-stren^h  effects. 

Shiffrih  et  al.  (1990)  show  that  a  modification  of  SIAM  can  handle  these  results. 
In  this  modified  SAM  model  it  is  assumed  that  different  items  are  stored  in 
separate  traces  but  repetitions  of  an  item  within  a  list  are  stored  in  a  single 
memory  trace.  Second,  the  variance  of  activation  of  each  separate  trace,  when 
the  cue  item  is  unrelated  to  the  item(s)  encoded  in  the  trace,  is  constant 
regardless  of  the  strength  of  the  trace.  This  latter  assumption  is  inconsistent  with 
previous  formulations  of  SAM  but  is  defended  using  a  differentiation  argument: 
the  better  the  image  is  encoded,  the  clearer  are  the  differences  between  it  and 
the  test  item,  and  hence  the  lower  the  activation.  In  this  way,  a  constant  or  even 
decreasing  variance  may  be  predicted,  depending  on  the  weighting  of  context 
and  item  cues. 

A  crucial  aspect  of  this  explanation  is  that  repetitions  of  an  item  are  assuihed  to 
be  stored  in  a  single  memory  image.  To  evaluate  it  further,  Mumane  and 
Shiffrin  (1991)  tested  whether  a.  reversal  of  the  list-strength  effect  in  recognition 
occurs  if  repetitions  are  presented  in  such  a  way  that  they  a^a  likely  to  be 
encoded  in  separate  images.  They  found  that  repetitions  of  words  in  different 
sentences  produced  a  list-strength  effect  whereas  repetitions  of  entire  sentences 
did  not.  This  demonstrates  that  the  nature  of  the  encoding  of  a  repeated  item  is 
a  crucial  factor. 


3.4  Evaluating  the  models 

In  this  chapter  we  have  shown  that  current  mathematical  models  of  memory  are 
capable  of  handling  mai^  classical  and  new  findings  in  recall  and  recognition. 
We  suggest  that  modeL  of  this  type  are  superior  to  verbally  stated  theories  of 
memory.  Arguments  in  favour  of  the  modelling  approach  include,*  (1)  the  ability 
to  predict  the  size  (and  not  just  the  direction)  of  the  effect  of  experimental 
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(4);Ae-fiatt/that/ajmi^elf(esiwciaUy%4theS^ 

used  :tp  ^e^riment^wth.  proceMmg  or  sstraie® 

cmd^'  v^ables  -Aat:  a  :®yen  prediction,  md  (5),'the  ’fad  .tha^ 

often  dembnstrde  the  UimtatioM  of -more  intuitive  feasPning. 

Such-a-  concluMon:  is  hpwever  often  critidzed;  on  ithe;^6imd.-that; ;g^  Models 
of  memPry  of  the  discu^ed  m’  this-  chaptw  sare  too  yersatild  ^at  is,  the; 
models  usually  incb^pfate  1  relative]^  large  ^number  of  procesres  and  :paraine< 
tei^  that<  seem:  to  enable^  them  to  predd  ^Pst  ^  type  ;Of  empiric  result  In 
addtipn,  4t  is  often  difficult  to  intuit  what  a  specific  model:  wll  predid  in  a.  given 
situation..  This  contrasts  with  the  simplidty  of  typical  verbal,  npn-quantitative 
explanations  of  memory  phenomena,  It  will  be  ar^ed  in  this;  sedibn  that  this 
difficulty  is  often  inore  apparentithan^real, 

First,  quantitative  models  also  make  qualitative  predictions  that  do  not  depend 
on  parameter  \^ues.  That  is,  in  order  to  evaluate  a  models  ability  to  predict 
data,  one  should  not  only  examine  the  phenomena  that  it  cm  predict  but  also 
take  into  account  whether  it  niakes  strong,  parameter-free  predictions  about 
results  that  should  not  occur  (iio  matter  what  parameter  values  are  used). 
Second,  if  a  particular  predidion  depends  on  the  specific  parameter  values  that 
are  used,  it  should  be  possible  to  arrange  the  experimental  situation  in  Such  a 
way  that  that  particular  result  is  reversed.  Third,  the  argument  may  also  be 
turned  around:  if  the  ability  of  a  model  to  predid  a  particular  phenomenon  turns 
out  to  depend  on  parameter  value,:,  one  may  well  ask  whether  a  corresponding 
qualitative  explanation  is  in  fad  logically  sufficient.  Finally,  some  results  are 
indeed  complex  (i.e.  dependent  on  a  number  of  interacting  processes)  whether 
we  like  it  or  not.  In  fact,  one  advantage  of  quantitative  models  of  the  type  dis¬ 
cussed  in  this  chapter  is  that  they  may  be  used  to  see  whether  particular  'Wbal" 
explanations  indeed  hold  true  when  tested  in  the  context  of  a  comprehensive 
model  of  human  memory.  The  next  sections  will  focus  on  spedfic  aspects  that 
are  involved  in  this  discussion. 

3.4.1  Number  of  parameters 

Current  quantitative  models  of  memory  frequently  incorporate  a  dozen  or  so 
parameters.  These  parameters  reflect  both  structural  aspects  of  the  memory 
system  (decay  rates,  processing  times)  as  well  as  task-related  aspects  (weighting 
of  cues,  stopping  criteria,  decision  criteria).  When  a  model  is  fitted  to  a  set  of 
data,  these  parameters  usually  have  to  be  estimated  ftom  those  data,  that  is,  they 
are  given  vdiies  so  as  to  bptimize  the  fit  to  the  data.  Although  this  procedure 
can  be  rigourously  defended  on  statistical  grounds,  it  does  ^em  to  mai^  to 
involve  a  bit  of  cheating.  It  is  probably  for  this  reason  that  the  relatively  large 
number  of  parameters  in  current  models  is  frowned  upon. 


Iiijma^.  c^s  hweyeiijjthe  niraW 

tfie:qu^tktivenature^i)f-thejpre^<Hip5|*di^s>notideiwn4,oti 

values;  iffi  -fua^'  0|>the  csim^atibns  sarev  using;  aj  ■sinj^ei  iset  lofj 

p^ameter  N^ues  s(seb^eig;,:Metc^fe 

19i88;v;Hj&Cnii^r  i9^  ■  where  dp-  reveree  ja 

paftidiiiu-  predictioiil  ;bmpiric;4-  bVibenPie^shp  cPneeming/^this; 

prediirti6n;(^e^ei|i  %e  predictidmpf  td'Jrevej^f^pf '  ^ 

M^bn  of  the  ebntexbijd  strengA;  par^feter  ^in 

l99i).  Another  point  is  that  hon-quantitative  ^mbdelsf^alsp  include^^ 

that  is,  degrees  of  6eedom^  althou^  tl^  is  rarely  realized.  To  put  this  in 

Mother  way,  most  e^lauatiohs  for  memoiy  phenomena  by  mpdels^^  mcmpfy 

might  ;be  formulated?  in  a;  qiialitatiye  way:  In  this  way,  there  wouldvhbt  be  any 

basic  difference  between  quantitative-  and  qualitative  models.  tHbweyef,  tthe 

resulting  theories  wbuld?have  lost-most  of?Aeirsexplanatbiy  power; 

3A2  Number  of. pnfcess^ 


Most  of  the  difficulties  Mrith  wellrspecifieri  quantitative  models  have  toodo  with 
the  relatively  large;  number  of  processes  lhai  are  usually  sproposed.  This  is 
especially  the  case  when  models  attempt  to  be  applicable  to  a  large  number  of 
differerit  experimerital  paradigms.  As  erhphasized  by  Smith  (1978),  there  is  a 
tradeoff  between  generality  ahd^  simplicity  of  theoretical  models;  The-  problerh 
here  is  that  due  to  the  number  pf  processes  and  the  nuniber  of  parameters  (or 
quantitative  relations)  involved  in  complex  memory  models,  it  is  often  not 
possible  to  make  predictions  about  the  behaviour  of  the  model  except  through 
quantitativesimulations. 

We  give  an  example  (drawn  from  own  experience)  to  illustrate  this  point.  When 
the  SAM  model  was  first  applied  to  the  part-list  cuing  paradigm  (see 
Raaijmakers  &  Shiffrin,  1981);  it  was  not  at  all  clear  whether  it  would  or  would 
not  predict  this  effect.  ■Furthermore,  even  after  the  prediction  turned  out  to  be 
successful,  it  was  not  immediately  clear  (to  say  the  least)  what  factors,  in  the 
model  were  causing  it 

What  .this«sho^^  is  that  it  is  not  possible  to  .make  intuitive  predictions  concerning 
the  behaviour  that  a  model  will  show  under  specific  task  conditions.  However,,  it 
should  be  evident  that  a  similar  problem  hol^  for  "verbal"  theories  of  memory. 
In  such  qualitative^accounts  it  is  not  clear  what  the  boundary  conditions  are  that 
apply^to  a  particular  prediction.  The  lessoii  that  can  be  drawn  here  is  that  much 
more  effort  should  be  invested  in  theoretical  analyses  of  the  factors  involved  in 
predicting  empirical  phenomena;  Such  analyses  should  focus  on  the.  role  of  each 
of  the  propose'^  processes  in  the  explanation.of  a  particular  phenomenon. 
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:  '  M  A  <  :  ; 

Mthpugji:'^=  flie  modelsii^at'lwsti^ncomidcred' 
iVe:  #atoer,  .therc^^  .artenden^  ;&=  iomenfciwrki  ;to;  restrittysAe 
qualitetive  :pre^ctio^>;Aat;;is,‘s;onci;ia^yzesybniy  iwheffier  4^ 
geneMxdirecdpn  pi^lm^effect,  ^athet^thMl1hej(cx4^rIiia^tude^5In^^^ 

traditipnvpf^the  1960^  ^mdi  497^,.  gpPdiiess-ipf^^^  me^uresi  such 
chi-squaie ;  statis^c.  do  mdt;figu:i!e:ipipnMhcntly;  mv^&c^typ|^^#d^ 
presents  a;fonniy;moderiof.:^^  ,  ... 

TWs  poses,  a  problem;  Oh-^the  one  hand;  be  defended  that  one:  does  npt 
wamt'  to  .focus;  >tpp  specifi^'pJUiy  ;ori  the  met  niuncrical:  det^  one- partiadai 
experiment,  on  the  pthefe  haiid.  it  wuld  be  -desirable  4o  at  least  Ippk  at  the 
relative  ma^tude  of  a  particular  effect  (relative  to  .pther  predicted  effects). 
That  is,  suppose  that  there  are  two  ^phenomena  of  interest:  Effect  A  and  Effect 
B,  where  A  is  a  large  effect  and^  B  a  small  (but  cPiuistent);one;  . It  is:  conceivable 
that  a  model  would  be  able  to  predict  both  A  aiid  B  m  a  qualitative  manner  but 
that  it  would  always;  predictfcither  A  and  B  l^^  or  A  and  B  both  large. 
Siichi a-  "misfit"  would  mot  be  detected  if  the  ?analysis-focuses  oidy-on^the-qualitat- 
ive  aspects.  . 

Fortunately,  most  presentations  of  formal  models  for  memory  employ  a  strategy 
that  is  m  between  these  two  extremes.  The  typical  approachds  to  use  a  single  set 
of  parameters  to  examine  a  set  of  data  (or.  data  patterns)  that  is  representative 
of  empirical  findings.  Although  none  of  the  actual;  data  is  really  fitted  in  the 
traditional  sense,  the  use  of  a  single  set  of  parameters  makes  it  possible  to  verify 
that  the  model  makes  predictions  that  are  in  the  right  ballpark  in  terms  of 
relative  effect  sizes. 

Hence,  we  may  distinguish  between  three  de^ees  of  comparing  the  model  to 
actual  data;  qualitative,  quantitative,  and  what  might  be  called  semi-quantitative 
analysis.  The  first  involves  only  the  direction  of  a  difference  between  conditions; 
the  second  involves  a  direct  comparison  between  the  predicted  and.  observed 
data  using  a  goodness-of-fit  measure;  finally,  the  third  does  not  involve  a 
goodness-of-fit;^measure  but  does  look  . at  the  sizes  of  the  predicted  and  Observed 
effects. 

Although  real  quantitative  fits  remain  a  desirable  feature,  it  might  be  argued 
that  the  proper  approach  is  to  first  aim  for  a  seini-quantitative  prediction  of  the 
data.  In  this  phase,  the  emphasis  is  on  showing  that  a  model  cah  deal  with  a 
variety  of  findings  frpm  different  task  paradigms.  At  some  point,  a  nurhber  of 
promising  models  will  have  been  developed.  At  that  stage,  the  tiihe  seenis  to  be 
ripe  for  quantitative  tests  in  which  several  models  inay  be  compared  in  terms  of 
goodness-of-fit.  We  believe  that  the  demonstrated  potential  of  current  models  of 
memory  justifies  the  expectation  that  future  work  in  this  area  will  involve  more 
comparative,  quantitative  testing. 


(^te:  re^nti^  John,  utoderapm  ;^199d;i  mc  sQm;  .Mdei^n  &  Miisp^  :198|)  Aas 
p]X)ppsed:a:mode^&at  attempts; -ih^a^i^h^^tp'mel^spmeipfthesbestieature^ 
■£he  'approaches  Been  ^  |deMe^  ^quantitative, 

process  models  vs.  geherid,.  yprbai,,  de^ptive  imPdel^;.  ’Rationed’  mOdei. 
^b^asseS  det^  of  irepresentetipn  and^pro  tPrffie^j^itest-^^ 

^d  mstea^i  is  :^e;d  a^  iSvpiganized  so  as 

to  spive  the  iinemprizert  ;problems\’in-  ^  in  ^^;5pvent  retrieval 

situation,  it  is  assumed  that  the  even^  stpied  .iti each  bave  a  number 
assigned  tp= them,  representing^their^probability  of  (containing  the 

desired  information).  It  is  assumed-  that  theK-  eyents^;afe:  Marched  m 
their  reiev^ce,  either  until  a.ietrieval  pccurs  pr  arstPppihg  criteriPh  is  reached. 
The  probabilities  are  based  bn  two  muitipUcativP^  &^  the  ;past.  history  of  an 

evenrt  usefulness  (independent  of  the  <mes  used  to  probe  memory),  and  the 
likelihoods  Of  relev^ice  a^Pciated.  with;  the  cues.  :So  :far  only  the:  bares  of 
applications  to  memPry  paradigms  are  available.  It  is  interesting,  that  the  model 
operates  at  a  very  abstract  level,  and  yet  offers  quantitative  predictions  for 
certain  phenomena.  Although  initial  results  are  intri^ing,  it  is  far  too  early  to 
assess.the  long  run  usefulness  of  the  approach. 
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